A Critical Survey of the Methodology for IE Evaluation

نویسندگان

  • Alberto Lavelli
  • Mary Elaine Califf
  • Fabio Ciravegna
  • Dayne Freitag
  • Claudio Giuliano
  • Nicholas Kushmerick
  • Lorenza Romano
چکیده

We survey the evaluation methodology adopted in Information Extraction (IE), as defined in the MUC conferences and in later independent efforts applying machine learning to IE. We point out a number of problematic issues that may hamper the comparison between results obtained by different researchers. Some of them are common to other NLP tasks: e.g., the difficulty of exactly identifying the effects on performance of the data (sample selection and sample size), of the domain theory (features selected), and of algorithm parameter settings. Issues specific to IE evaluation include: how leniently to assess inexact identification of filler boundaries, the possibility of multiple fillers for a slot, and how the counting is performed. We argue that, when specifying an information extraction task, a number of characteristics should be clearly defined. However, in the papers only a few of them are usually explicitly specified. Our aim is to elaborate a clear and detailed experimental methodology and propose it to the IE community. The goal is to reach a widespread agreement on such proposal so that future IE evaluations will adopt the proposed methodology, making comparisons between algorithms fair and reliable. In order to achieve this goal, we will develop and make available to the community a set of tools and resources that incorporate a standardized IE methodology.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluation of a nanosuspension formulation prepared through microfluidic reactors for pulmonary delivery of budesonide using nebulizers

This study aimed to determine the aerosolization behavior of a nanodispersion of budesonide, prepared using microfluidic reactors. The size and morphology of budesonide nanoparticles were characterized by photon correlation spectroscopy (PCS) and transmission electron microscopy (TEM). Processing/formulation parameters for formation of the nanoparticles were studied to determine their effects o...

متن کامل

Evaluation of a nanosuspension formulation prepared through microfluidic reactors for pulmonary delivery of budesonide using nebulizers

This study aimed to determine the aerosolization behavior of a nanodispersion of budesonide, prepared using microfluidic reactors. The size and morphology of budesonide nanoparticles were characterized by photon correlation spectroscopy (PCS) and transmission electron microscopy (TEM). Processing/formulation parameters for formation of the nanoparticles were studied to determine their effects o...

متن کامل

Developing Intercultural Awareness and Skills in English Majors: A Constructivist Approach

In the fast-changing modern world of today, learners need global skills for life-long learning and effective communication. Among these skills are intercultural competence and critical thinking. Although teachers have acknowledged the importance of the inclusion of global skills in their actual teaching procedures, they still need more concrete methodology and tangible pedagogical frame...

متن کامل

Use of Evidence-informed Deliberative Processes by Health Technology Assessment Agencies Around The Globe

Background Evidence-informed deliberative processes (EDPs) were recently introduced to guide health technology assessment (HTA) agencies to improve their processes towards more legitimate decision-making. The EDP framework provides guidance that covers the HTA process, ie, contextual factors, installation of an appraisal committee, selecting health technologies and criteria, assessment, a...

متن کامل

بررسی رابطه میان مؤلفه‌های برنامه درسی پنهان با ابعاد گرایش به تفکر انتقادی دانشجویان

The purpose of this research was survey of relation between hidden curriculum components and dimensions of students’ tendency to critical thinking. Methodology of the research was descriptive correlation. The sample included 381 students of Shahed University and two indicators of measurement of university hidden curriculum dimensions and standard index of tendency to California students’ critic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004